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Abstract 

VIZIR provide* an (Migrated mechanism for an-lln* dc> 
bugging, performance analysis, and data visualltfltton of 
messag&passing parallel applications. The current VtZlH 
includes; (I) a mechanism to multicast wlndow*bascd com- 
mands, message passing and prvgram execution events 
among tasks; (2) the ability to select groups of tasks to in- 
teract these events; 0) a means to report when these events 
cause errtfrt in the tasks: and (4) ad hoc visualization of 
distributed arrays using existing visualizers. 

1.0 Introduction 



Distributed computing is emerging as a realistic and cost- 
effective approach lo parallel processing. This trend is 
becoming popular, largely duo to the advances in network* 
ing of heterogenous computing resources thlt allow the 
U«r* to use the network as a parallel machine. Several 
users of conventional parallel supercomputers now con- 
sider such distributed parallel architectures as an alterna- 
tive for their applications. However, despite the immense 
potential of network-based distributed computing, the 
existing program development tools for such systems are 
still primitive in terms of their efficacy. Moreover, the 
problems involved in developing message-passing pro- 
grams for any parallel architecture are nocr-trfvial. Visual- 
izing the program behavior has proved to be a useful 
technique for debugging and analysing the concurrent 
behavior of parallel programs but the synchronous nature 
of distributed systems make the task even more challeng- 
ing. VIZIR is an integrated visualization and debugging 
environment that has been developed at Hewle^packard 
Laboratories to meet with these challenges of tool devel- 
opment and to facilitate the ask of distributed program- 
ming. Its main objective is to use the strengths of network- 
based computing for sea^endai/durnbuted programs and 
to apply the existing technologies and tools to solve the 
problems of distributed program development. Due to the 
nature of the specific issues involved in developing distrib- 
uted programs as opposed to the development of sequen- 



tial programs, the manner in which these existing tools and 
technologies arc applied is the key to solve several tool 
development problems. The design of VIZIR addresses 
these issues to enable the users to use their favorite debug- 
gins tools for debugging their distributed programs. Simi- 
larly, it enables the users to add multiple visualization 
tools of their choice lo VIZIR environment to understand 
the dynamic execution behavior of their applications. 

Tool development for programming parallel and distrib- 
uted systems has been an active area of research but has 
largely fallen short of the user expectation? [10]. In many 
cases users arc not satisfied with the way a particular tool 
works and the learning curve associated with using it to 
accomplish a specific tusk. VIZIR allows the user to use 
his/her favorite tool by making it a port Of the program- 
ming environment. In order to realize this environment, 
following issues are involved: 

1. Enabling the programmer's favorite, existing debug- 
ging and visualization tools to become apart of this 
program development environment. This has become 
an important issue in view of increasing indifference of 
the users toward "novel" parallel program develop- 
ment tools. 

2. Synchronizing and controlling the activities of the 
debuggers and visualizer to provide a consistent view 
of program execution to the user. 

3. Integrating heterogenous types of visualization tools, 
such as general-purpose and conventional performance 
visualization tools across different platforms. This is 
necessary to address a wide range of requirements of 
various types of program behavior visualizations, such 
as application performance, system performance, and 
program data visualizations. 

We have addressed these issues in the design of VlZiR to 
assist the users to develop distributed programs using the 
PVM [4 J niesasge-passing library for a duster of worksta- 
tions. We have enabled several commonly used debugging 
tools, such as Softbench's Softdebug, Hewlett Packard's 
DDE. and IBM'S XD&, Similarly, we have integrated pop* 
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ularly used parallel program visualization tools such as 
ParaGraph in our System, in addition to commercially 
available visualization tools, such as Mailab and Gnuploc 
for customized program performance and data visualiza- 
tions. This paper depicts the Architecture and the main fea- 
tures of the VIZIR. 



2,0 Architecture of VIZIR 

In a distributed programming environment, an application 
consists of multiple processes running on one or more 
physical nodes thai arc distributed in a network. VIZIR 
executes each of these application processes under the 
conLrol of an available (and perhaps user's favorite) 
debugger. One debugger executing a single process pre- 
sents the same scenario as debugging a single sequential 
application. The only difference is the me si age- passing 
among these otherwise independent processes. Visualiza- 
tion has been recognized as an appropriate technique to 
represent message-passing and program execution behav- 
ior [8]. Several tools have been developed and used to rap- 
resent Various aspects of concurrent program and system 
behavior [9], VIZIR enables the use of these visualization 
too) on-line by providing three major functionalities: 

1. multicasting menage-passing events among tbe appli- 
cation processes; 

2. integrating visualization tools to represent multiple 
perspectives of application behavior, and 

3. controlling and synchronizing the execution of applica- 
tion processes and visualization tools. 

Figure 1 depicts the overall architecture of VIZIR and its 
functionality. Despite the distributed processes, the envi- 
ronment allows the user to control the configuration and 
actions of all the distributed application processes and 
tools. It is important to note that VIZIR is running locally, 
whereas the other application processes, debuggers, and 
visualization tools might be running locally or remotely. 
Therefore, VIZIR acts as a controller for (he whole envi- 
ronment which is the key to resolve tbe problems involved 
in visualizing distributed application programs. Wc 
present the major functions of VIZIR related with distrib- 
uted program visualization in the following. 

2.1 Multicasting Process Events 

Parallel program visualization tools have to rely On some 
mechanism for multicasting events among the concurrent 
proccsscft, in Order to represent this activity graphically. 
Program code is instrumented and linked with the avail, 
able communication library to multicast these communicz- 
tion events. However, most of the instrumentation system 



for parallel programs in general (such as PICL [3]) and 
distributed programs in particular (such as Xab [I J) per- 
turb the application behavior mainly due to additional 
mess age- passirig required for generating and communicat- 
ing the trace data. In the case of Xab, distributed programs 
using pVM message-passing library need to use the same 
message-passing calls to accumulate the trace data. VIZIR 
docs not need Such explicit message-pacing which might 
double the actual communication cost of an application. 
Instead, it relies on a Multiple Event Protocol (MEP |7|) to 
multicast message-passing and program execution events 
among the application processes. MEP uses interact ten I 
communication primitives that incur less overhead than 
using PVM calls to receive these events. Additionally, 
there is no explicit binding between application and 
VIZIR'S multiple event processing and synchronization 
activities. Whenever an application process executes a 
particular message-passing function, it sends that event to 
the event queue of underlying System which can be trig- 
gered by VIZIR. Once VIZIR receives the event, it assigns 
the event a time-stamp and generates a corresponding 
trace record. This trace record can be further processed 
and passed on to the visualization tools to dynamically 
visuaJifcc the application behavior. 




FIGURE 1. Arehheaiyt«f ViZUX tfistrftiicd 
dCVttepntnil ud TiaializAtian 



2.2 Integrating Multiple loots 

Tool integration is a welNknown problem in software 
engineering and there is an on-going standardization effort 
to develop more practical frameworks for this purpose [2]. 
More recently, it has found its way into the design of tools 
for parallel programming because it is difficult for a single 
tool to satisfy nil the requirements of all the users [12). In 
order for the tool integration to be useful in case of parallel 
program development, it should meet two requjrentenu; 
There should (ideally) be no dependence between the 
internal semantics of a tool and tbe rest of the environ- 
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mail, to ensure generality of the design and to avoid any 
proclaim related with the issues of overall performance 
and portability. 

4. It should be possible for die environment to pass neces- 
sary performance data and the desired actions to be 
taken oo that data by the tool. 

A tool interface fTI) was designed to specifically mtci the 
above two requirements. The functionality of the TI is 
depicted by Figure 2. A* shown in the SgUrc, each visual- 
ization too] which is to be integrated into the rest of the 
environment needs an interface. This interface is used for 
two specific purposes: (I) receiving data and control infor- 
mation from VIZIR; and (2) forwarding this data and con- 
trol information to the particular tool that Oft interface is 
responsible for. The interface converts the control infor- 
mation in a form which it in accordance with the seman- 
tics of that particular tool. Optional bi-directional 
communication is supported by the interface for synchro- 
nization purposes, 

VIZIR simplifies the Issues involved in tool integration. It 
provides a common interface to obtain user input 10 con- 
trol visualization tools as well as the rest of the environ- 
ment VIZIR sends the trace records to the visualizes as 
soon as tbey am generated to provide on-line visualization 
of dynamic program behavior. It can also store the trace 
data On the local disk as a trace fife for the purpose of pro- 
gram replay [5], 
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13 Controfling and Synchronizing 

Controlling and synchronizing are perhaps the most 
important functions of VIZIR that allow the integration of 
various visualization and debugging tools to work as one 
programming environment for the user. User can start or 
stop the execution of the entire distributed application 
using the control mechanism provided by VIZIR through 
in? GUI. Application processes are executed under the con- 
trol of debuggers and VIZIR sends the control commands 
to the multiple debuggers Using Event Stnje Protocol 
(ESP [6]). ESP provides a mechanism to multicast win- 
dow baaed commands from single COfilrul window to some 
subset of debuggers and visuahzera on various processes. 
It allows VIZIR to control the debuggers without any 
binding between the two. Therefore, VIZIR environ meat 
is efficient, flexible and extensible. Application processes 
and visualization took can Optionally be synchronized 
with the VIZIR to ensure the consistency between applica- 
tion behavior and visualization displays. Synchronization 
has been kept as an Option because it is bound to slow 
down the execution of application processes. 

3.0 Features of V12IR 

The design features of VIZIR presented io Section 2 have 
been used to provide several distributed program visual- 
ization features. Efficacy of the presently available visual- 
ization tools for distributed programs is limited due to the 
unavailability of these features, This section briefly pre- 
senta some of these features. 

3.1 ConBfeteni Tbm Stamps 

Generation of consistent time stamp baa been a non-trivial 
cojutfraint for existing distributed program visualization 
tools [I]- VIZIR overcomes this problem due to its archi- 
tecmre that relies on MEP and mokes it a global controller 
for the whole environment When the applications are run- 
ning on different workstations, it U not possible to have a 
notion of consistent global time. VIZIR docs not have the 
application processes to generate din time-nump* with the 
events. Instead, it only collects and orders the information 
about event occurrences through MEP and togs the time* 
stamps. VIZIR orders all incoming events before generat- 
ing time-stamps, ensuring that they are consistent. Also, 
there is negligible delay between a communication call 
and triggering that event using MEP, therefore, the time- 
stamps arc reliable for all practical purposes, 
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3.2 Ort-the-RyVisual«atJon 
VIZIR can send Die trace records to the visualization 
tools, immediately after (hey arc generated by aligning 
time-stamps. Mostly, visualisation tool* iuch as Para- 
Graph process one trace record at a lime. As soon as a 
new trace record is received, all the selected displays are 
updated by the tool. This process ii facilitated by (he 
tool interfaces that were presented in Section 2. 






Event domain views using Matlab a* an analytic 
and visualization engine; Integrated with VIZIR* 



VIZIR provide* control and 
tyncfanwittarlon of (he environment 




FIGURE 3. Multiple views from multiple domains 
and tools to vtouaifec mod analyze on- 
to** execution behavior Of a 

distributed program. 
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l>ptca] application visualization displays from 
ParaGraph integrated with VIZIR 
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34 Muttfpj* View* using Muftfplft Tools 

Visualizing the behavior and performance of » p^illel or 
distributed program it always a mutti-dimenslona) assess- 
ment Not only docs it take multiple tod* but also multiple 
views and multiple-domain* (II] are needed eq represent a 
complete picture of program behavior to the usee VlZiR 
provides typical performance visualization and animation 
views that are implemented in ParaGraph. It can also 
allow the use of geraral-ptirpos© data analysis tools such 
as AVS* Matlab, Onuplot, Mathematics, and so on to rep- 
resent multiple perspectives on application performance 
and behavior, figure 3 represents some of these views. 



4-0_ Condngjons 

VIZIR is an on-going experiment in Hewlett-Packard 
Research Ubs. VIZIR provides standard window inter- 
faces to existing visualizcrs/debuggen with on-the-fly 
control and synchronization. Processes in the parallel 
application can be halted by the debugger at the same 
point that performance and data visualization is being 
done. In addition* For example, performance and data 
errors detected by performance and data analyzers may 
aummalically trigger the debugger to halt the process. 

Although we have applied our mechanisms to prototype 
debugging environment for parallel programs, they have 
much wider applicability, This approach can be used any- 
time we want to do the same thing on more Ulan one 
machine. Examples include administering cluster, snaring 
large volume of data such as video. 
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Most people think that collaboration implies that several people 
are sharing work on a single application with shared displays. In 
fact, collaboration is more. It includes the concurrent control of 
multiple applications by a collaborative' group. To enable this 
more powerful form of collaboration, we show how to combine 
earlier mechanisms for single client, multiple server computing 
with a new mechanism called ESP (Event Sense Protocol) for 
multiple client, single server computing. We describe two 
extended examples — a working prototype of a multi-user, heter- 
ogeneous, parallel debugger and a commercial banking applica- 
tion. 
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1.0 Introduction 



Today's collaborative computing environments do not address collaboration using multiple appli- 
cations executing concurrently. There are many example of systems that allow collaboration using 
a single application on a single data repository. For examples, HP's SharedX [3] product allows 
sharing of an X protocol based application among users in a distributed computing environment. 
MMConf [4] provides the Diamond Multimedia conferencing System. BBN's Slate [5] allows 
users to collaborate on document development. However, all the above conferencing systems are 
limited to sharing a central copy of an application. There is no mechanism to concurrently control 
more than one application or more than one data repository. The missing piece is the subject of 
this presentation. 

We have developed a single server, multiple client environment. ESP, Event Sense Protocol 1 , 
allows a user to control several clients (applications) simultaneously. A key feature of our system 
is that any existing application can be used with no modification of any kind. For example, ESP 
enables us to update multiple copies of a Lotus spreadsheet by entering the commands once. In 
fact, the applications being controlled need not be running on machines of the same architecture 
or even be identical applications. ESP allows us to control Lotus running on HP and Sun worksta- 
tions and Excel on an IBM system by typing commands once. The only requirement is that the 
commands typed be meaningful to each system. 

This kind of multiple application, multiple data repository collaboration has many uses. A corpo- 
rate financial officer could update divisions' independently held spreadsheets to reflect changes in 
Federal tax law and leave the local revenue numbers unchanged; division heads may change local 
numbers concurrently. A secretary could change the boilerplate part of standard letters used by all 
the Company's branch offices. 

Combining the conventional single client, multiple server collaborative environment with the sin- 
gle server, multiple client model of ESP completes the picture by providing for multiple applica- 
tion collaboration. Application experts in different locations could simultaneously control various 
aspects of a distributed application running on several machines. An instructor teaching Frame- 
Maker could control many student documents simultaneously to illustrate a particular point while 
still allowing the students to work on their own reports and commenting on their content. Systems 
support people could work with users to improve distributed applications. 

The next section describes ESP, a mechanism to enable concurrent application control. Section 3 
shows how ESP can be integrated into a collaborative computing environment. Section 4 adds a 
couple of extended examples of multi-user, multi-application collaboration. Section 5 discusses 
related work and Section 6 presents our summary. 



1. Patent pending 



i 



2.0 ESP — A Mechanism to Enable Concurrent Application Control 

The current graphical user interfaces GUIs [7] are based on a single-threaded dialog, where the 
user operates on one single command button to invoke one application and to execute one single 
function at a time. There has been a need to have a mechanism to sense user commands and dia- 
logs, then to control, manage, and multicast them to a set of applications for executing some func- 
tions simultaneously. ESP provides this mechanism without requiring any modification to existing 
system or application software in a distributed environment. 

ESP has the following features: 

• Built on a multiple client-single server model and uses standard window interface. There are 
no special libraries, and there is no need to compile or even re-link the existing application 
source code. This mechanism should be portable to all window systems. 

• Provides event sensing and multicast mechanisms to make selected processes execute 
concurrently. 

• Provides a concurrent dialog to a group of applications to execute multiple functions 
simultaneously. 

• Provides layered GUI interface to allow graduated access to the more powerful and 
complicated GUI capabilities. 

• Provides a simple point-and-click approach to dynamically connect or disconnect selected 
existing applications in a working context. 

• Provides dynamic context for multicasting. 

• Provides dynamic button assignment for heterogeneous applications. 

2.1 ESP Architecture 

ESP is built on a multiple client-server model. Figure 1 illustrates the overall architecture. It 
contains two key components: 

1 . Concurrency Control Window (CCW) 

2. Event Sense and Muticasting Processing (ESP) 

FIGURE l.ESP Architecture 




2 



Applications use windows to communicate with users. Users request an application to perform a 
function by sending events, such as a result of a key, mouse button, or sprite motion. The 
application GUI behavior understands its window events and how to interact with the application. 
This mechanism is started by sensing all the possible events happening on an application window. 

In general, there are two types of windows commonly used by applications to interact with users: 
buttons and text fields. The event type received on these windows are button press, button release, 
key press and mouse movement. Our mechanism senses the common events and puts them into the 
following two categories: keyboard events and mouse events. 

Both keyboard and mouse events contain key press/release and button press/release. Both events 
are sent by the server to the application when the user presses a key on the keyboard or presses the 
mouse. Only an application that has specifically asked to be informed of that type of event will 
receive them. 

ESP senses and selects the above standard common window events as the potential candidates to 
be managed and multicast. However, additional events may need to be included for processing 
special types of new application windows. 

2.2 ESP Operation 

ESP operation comprises the following four major steps: 

2.2.1 Access Running Application Windows 

The purpose of accessing running application windows is to find the window characteristics, their 
child windows, and their identifiers. This mechanism uses window query tree system calls, window 
images, or pointer movement to find the corresponding application window structure and its 
identifiers. Normally, buttons and text fields are the children of the application main window. 
Buttons and text fields are the input windows used by users to interact with the application. 

2.2.2 Build CCW and Child Windows 

This mechanism builds the Concurrency Control Window (CCW) to control and manage the 
activities in the event sensing and multicasting session. In order to receive the same event types, 
the CCW has to simulate/build the same type of child windows as the application, such as a button 
to represent the application's button, a command line input field to represent the application text 
window. 

2.2.3 Sense User Window Events 

Using a CCW (Concurrency Control Window), this mechanism senses the user actions/window 
events and passes them to the corresponding application windows to execute some function 
concurrently. We use Xt Intrinsics multiple input application contexts and registered event handlers 
in our current X Windows prototype. Similar methods can be applied to other window systems. 
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2.2.4 Multicast User Events 

After building the same type of window, ESP controls and multicasts the window events received 
from the end user to the selected windows to execute a user command such as simultaneously 
changing the sales tax rate in all participating spreadsheets. Using this mechanism, the user can 
concurrently manage the existing application's execution. 

There are two modes of operation: 

1. Global: 

Using CCW window global buttons to invoke all or a selected group of the running 
applications to perform a function simultaneously. 

2. Local: 

Using CCW's individual selection buttons to raise a specific window for executing an 
independent function. 

Methods for using these modes include the following: 

In the global mode, the application GUIs may be structured in a hierarchical fashion with an 
arbitrary number of levels. Interacting with a GUI leads to window events being generated on one 
or more of the children GUIs. The mapping of window events to children GUIs is many-to-many. 
The children applications may be sequential or concurrent. Window events on children GUIs lead 
similarly to further events being generated on grandchildren GUIs. As an example of multiple 
events being generated for a single child GUI, a show button on the CCW window could be 
mapped to both the data visualization and performance visualization buttons on the same child 
visualization application. 

Child GUIs may be specialized to different functions available in different applications or even 
different components of the same sequential application. In the local mode, ESP allows the inde- 
pendent construction of the different functional GUIs. A user can interact with the CCW window 
and access some generic functionality or access the individual children GUIs for access to spe- 
cialized functions. For example, a parallel application could be constructed out of a Lotus 123 
spreadsheet application, Pablo [2] (a performance visualization application), and a mail program 
in order to do some spreadsheet calculations on raw performance data, graph computed visualiza- 
tion data, and mail it to a distribution list. Specialized GUIs are provided by the individual appli- 
cation GUIs whereas generic and composite functionality is provided by the CCW window. 



2.3 ESP Operations Extensions 

ESP operation includes the following three extensions: 
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2.3.1 Dynamic Window Association 

ESP can associate the generic windows, such as buttons, text fields, with a specific window in the 
same or different applications. In addition, a group of application buttons can be associated with a 
generic CCW button. With ESP, users can dynamically associate their CCW buttons with any 
heterogeneous function they want to perform at run time. 

2.3.2 Event Concurrency Control, Grouping, and Synchronization 

ESP uses a layered structure to concurrently control the selected events happening in the existing 
application GUIs. One single user request may generate multiple dialogs with the applications 
which depend on the structure of the application GUIs. 

The user can dynamically add or delete an application window to or from the working context. 
Each application window is only used for processing local application functions. 

ESP allows the user to synchronize different types of window events to manage the order of 
execution by the applications. When associating the application buttons with a CCW button, the 
user can indicate the order of execution and which group of functions can execute simultaneously. 

2.3.3 Enabling Full-Screen Applications 

This mechanism can be also used on the applications that allow the user to interact with the screen 
anywhere on it. Example interactions include text insertion, button press etc. We make our CCW 
an exact replica of the application window. Putting the replica directly under one of the instances 
of the application window and setting the focus to the replica, makes it looks to the user like 
everything typed in one application window is simultaneously being typed in all instances of the 
application in the current context. 

3.0 Integration of ESP with Collaborative Computing 

Human endeavors involve more than one person, especially in a complex distributed and parallel 
application. Different expertise is need to work on different parts of a program. At times, it is 
appropriate to examine a single running instance of an application with multiple experts. For 
example, we can use HP SharedX to allow more than one person to observe and control the pro- 
gram on a remote workstation. However, there is no global control mechanism to allow users to 
control concurrently the interactions with a set of shared applications. 

ESP solves the problem of how to sense user actions, to control, and to distribute input window 
events to each selected application window for simultaneous execution in a distributed 
environment. The integration of ESP with collaborative computing achieves the goal of sharing 
multiple applications among users. It contains three key components: 

1 . Concurrency Control Window (CCW) 

2. Event Sense and Muticasting Processing (ESP) 
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3. Intercept requests from application clients and map them onto the other servers [11] 
Figure 2 illustrates the integration of ESP with collaboration. 

FIGURE 2. An Integration of ESP and Collaboration 
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4.0 Two Examples 

The examples presented here show how potential applications might use ESP. The first example is 
IVD [1], a Hewlett-Packard Laboratories Interactive Visualization Debugger working prototype 
for message passing parallel applications. 



The second example is a commercial banking application that coordinates multiple spreadsheets, 
a plotting program, and a mail program. 



4.1 A Parallel Visualization Debugger 

PVM (Parallel Virtual Machine) [6] is a popular system for writing parallel applications. PVM 
requires a debugging interface that will support debugging functions for all selected processes 
using a single window. Such an interface will be very valuable for actions such as simultaneous 
single-step execution in all selected instances. At present, PVM uses a separate window for each 
selected process. Users have to enter commands in each debugging window. 

IVD uses ESP to integrate existing debuggers and visualizers to debug/observe parallel applica- 
tion execution behavior in a distributed, shared-nothing multicomputer environment with the fol- 
lowing features: 
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4.1.1 Heterogeneous processing 

ESP is designed for heterogenous processing. Using ESP input event sense and multicasting capa- 
bility enables IVD to access existing debuggers across various platforms to debug large, complex 
problems. 

4.1.2 Scalability by grouping 

ESP is scalable. Each process has its own debugging and visualization facilities. ESP provides 
different "contexts" for the user to dynamically select the multicasting scope. By sending com- 
mands only to processes within the single, currently active context rather than all processes simul- 
taneously, the user may limit the amount of overhead generated by running multiple processes. 

With a context, the user can group certain debugger instances together. Each context then receives 
the same debugger commands entered on the IVD concurrency control window. By grouping the 
debugger instances, the user can indicate the execution order of groups of debugger instances. The 
user is not restricted to work only with individual debugger instances. This becomes useful when 
certain debugger instances are naturally associated with one another. For example, one context may 
contain all the slaves while another context contains only the master. Alternatively, there may be 
many contexts, each containing the debugger instances that are functionally related to one another. 
A given debugger may belong to more than one context or to no context. 

4.1.3 Dynamic Context Modification 

Figure 3 illustrates the mechanism to dynamically group a set of application programs to run 
simultaneously. 

A user starts a visualization and debugging session in a parallel programming environment. The 
user starts up the session by bring up a Concurrency Control Window. Afterwards, the user 
performs the following actions: 

1. Presses the "Debug" button in CCW to start the debugging environment and bring up the 
"Debugger 1" window. 

2. Types "debug program 1" in the debugger window command line. 

3. "Program 1" spawns "program 2" and "program 3" running under "debugger 2" and 
"debugger 3" respectively on different workstations. 

4. Context Add debugger 1, debugger2, and debugger 3 in the current debugging context. 

5. Uses global buttons on the CCW window to issue go, step and other debugger command. 

6. Types setting breakpoint, print commands in the multicast command line. 

7. Steps through application 1,2,3 concurrently to locate the problem. 

8. Press global "step" button to simultaneously single-step through the programs 1, 2, 3. 

9. ContextDelete "Debugger 2" from the working set. Step through programs 1 and 3. 

10. ContextAdd "Debugger 2" to the current working set. 
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11. Types "print a" to ask programs 1, 2, 3 to print the variable "a" in their windows 
simultaneously. 

12. Press "Go" button to ask program 1, 2, 3 to continue execution simultaneously 
Repeat the above global/local debugging operations until the bug is found. 



FIGURE 3. Dynamic Context Modification 
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4.1.4 A Multi-User collaborative Debugging Session 

Figure 4 shows that IVD with ESP appears as a single visualization debugger to the user. ESP 
manipulates the user input events, such as keyboard and mouse, and then multicasts these events 
to each running visualizer and debugger. ESP automatically triggers the visualizers/debuggers to 
execute received events from the user. User events are processed as if the window events had been 
directly entered into the visualizer/debugger windows. 

HP SharedX allows more that one person to share and control a single running instance of multi- 
ple applications including IVD. Hence a systems expert and the program developer can work 
together to fix the program. 
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FIGURE 4. Collaborative Debugging 
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4.2 A Commercial Banking Application 

Consider a group of departments that run a spreadsheet each month, plot some graphs, and print a 
report. The normal mode of operation is to have someone build the spreadsheet templates including 
formulas, set up the graphics formats, and build the reports. These are distributed to machines in 
each department. Secretaries in each department enter the department specific data into the work 
space and run the graphs and reports. Any changes to the spreadsheet formulas or report formats 
must be distributed to each department and updates made locally in a timely manner. 

With our mechanism the procedures would be somewhat simpler. The person responsible for 
maintaining the spreadsheet would bring up a control window. The spreadsheet application would 
be started on each node using the facilities of the control window, and a flag would be set to indicate 
that changes made to one machine's copy would be made to all. Next, typing in one window, the 
user would build the spreadsheet, graphics formats, and reports. Changes would be made the same 
way. The department secretaries still enter the department specific data. At the end of each month, 
one person could bring up the spreadsheets on all the machines, set the context so that commands 
are globally executed, and generate all the graphs and reports by typing the commands in only one 
window. 
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FIGURE 5.A Banking Application Association 
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An extension to this example is to use ESP to control completely different applications. The bank 
uses spreadsheets to calculate its annual result, then uses the specific graphic and distribution pro- 
grams to plot, produce reports, and send to all the customer. The generic "Button 1" on the CCW 
associates with Lotus 123 "calculate" button. The generic "Button 2" associates with three but- 
tons: "Graph" on the graphic program window, "Report" and "Send" buttons on the distribution 
program windows. 

It only takes the accountant two steps to distribute the annual reports: 

• Press the "Button 1" to compute the annual income and expense. The results are stored on a 
disk. 

• Press the "Button 2" button to read data from the disk, make the graphic charts, produce the 
annual reports, and distribute the reports to the customers. All three steps are performed by 
pressing the "distribute" button and all three functions are executed concurrently. ESP allows 
the user to define the ordering of the event to be sent. 



5.0 Related Work 

Research in the area of asynchronous collaborative computing has been published under a variety 
of subjects in the recent past. 

The Lotus Notes [8] product allows users to get instant access to the same information without 
any further intervention by sender or the receiver. This is similar to our commercial banking 
example using multiple, shared spreadsheets. There are two differences, however. First, ESP 
allows multiple users to manipulate multiple applications directly and not just update data files. 
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Second, Lotus Notes uses a relational database whereas ESP multicasts uninterpreted window 
events directly to application windows. 

A couple of other systems actually allow concurrent manipulation of multiple applications by 
multiple users. A system from IBM[9] uses global cursor for concurrent data entry and manipula- 
tion in multiple applications. A system from Wang Lab [10] routes keystrokes to processes which 
are associated by reference to a a routing table. However, most of these concurrent control of mul- 
tiple applications have limitations. IBM and Wang's mechanisms require cursor position tracking 
and keystroke interpretation while inputting common data into a plurality of programs. The over- 
head cost is very high in both. ESP provides a concurrency control window to receive, order, 
group and manage incoming window events and to forward them to running applications. ESP 
does not interpret keystrokes and pointer movement. Further, this mechanism can access existing 
application GUIs at run time without changing source code. There is no recompilation, or re-link- 
ing, and no special library is needed 

ESP can be used in many situations. We have already shown that it is the foundation on which to 
build an interactive, parallel, visualization debugger in a distributed and parallel environment. We 
have also noted that this event sense, control, and multicast ESP allows us to mix different 
application GUIs, such as to combine four applications' buttons — calculate, graph, report, and 
send — to generate a report and mail it. In addition to being used in a parallel debugging session, 
this ESP is directly applicable in the following applications: 

1 . Multiple database queries and updates 

2. Networking and manufacturing control 

3. Multiple shadow file creation and updates 

4. Multiple updates for spreadsheet and banking applications 

In general, ESP applies to all GUI applications. It allows us to take any application GUI and to 
make it distributed. It allows the end user to control and manage multiple dialogs with existing 
applications without impact on the existing system or application software. In addition, with 
SharedX, ESP makes collaborative computing possible to control existing applications 
concurrently in a distributed heterogeneous environment. ESP provides a singe image in a multi- 
user, multi-application distributed environment. 

6.0 Summary 

ESP provides unique features not found in other collaborative computing systems as shown in the 
previous sections. IVD uses ESP to provide an on-line interactive visualization debugging tool. In 
addition, ESP has the ability to group processes into various contexts and perform simultaneous 
operations. Programmers can use their favorite tools. The most interesting feature is that many 
different, unmodified tools can be accessed simultaneously by multiple users in a distributed 
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shared-nothing environment. With ESP enabling technology as a vehicle, we will continue to 
explore collaborative computing with multiple applications. 
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